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ABSTRACT 

We present a method for studying the proximity effect and the density structure around 
redshift z=2-3 quasars. It is based on the probability distribution of Lyman-a: pixel optical 
depths and its evolution with redshift. We validate the method using mock spectra obtained 
from hydrodynamical simulations, and then apply it to a sample of 12 bright quasars at red- 
shifts 2-3 observed with UVES at the VLT-UT2 Kueyen ESO telescope. These quasars do 
not show signatures of associated absorption and have a mean monochromatic luminosity of 
5.4 x 10 31 h~ 2 erg s _1 Hz -1 at the Lyman limit. The observed distribution of optical depth 
within 10 h~ 1 Mpc from the QSO is statistically different from that measured in the general 
intergalactic medium at the same redshift. Such a change will result from the combined effects 
of the increase in photoionisation rate above the mean UV-background due to the extra ioniz- 
ing photons from the quasar radiation (proximity effect), and the higher density of the IGM if 
the quasars reside in overdense regions (as expected from biased galaxy formation). The first 
factor decreases the optical depth whereas the second one increases the optical depth, but our 
measurement cannot distinguish a high background from a low overdensity. An overdensity 
of the order of a few is required if we use the amplitude of the UV-background inferred from 
the mean Lyman-a opacity. If no overdensity is present, then we require the UV-background 
to be higher, and consistent with the existing measurements based on standard analysis of the 
proximity effect. 

Key words: Methods: data analysis - N-body simulations - statistical - Galaxies: intergalactic 
medium - haloes - structure - quasars: absorption lines 



1 INTRODUCTION 

The hydrogen Lyman-a absorption lines of the 'Lyman-a forest' 
seen in the spectra of distant quasars, are a powerful probe of the 
physical conditions in the intergalactic medium (IGM) at high red- 
shifts (1.8 < z < 6). It is believed that most of the lines with 
column density, Nm ~ 10 14 cm" 2 originate in quasi-linear density 
fluctuations in which the hydrogen gas is in ionization equilibrium 
with a meta-galactic UV background produced by star forming 
galaxies and quasars. Non-linear effects are unimportant and there- 
fore the properties of the Lyman-a forest are described well by just 
three basic ingredients: quasi-linear theory for the growth of bary- 
onic structure, a UV radiation field, and the temperature of the gas 

* Based on observations collected at the European Southern Observatory 
(ESO), under the Large Programme "The Cosmic Evolution of the IGM" 
ID No. 166.A-0106 with UVES on the 8.2 m KUEYEN telescope operated 
at the Paranal Observatory, Chile. 



(Bi 1993; Muecket et al. 1996; Bi & Davidson 1997; Hui, Gnedin & 
Zhang 1997; Weinberg 1999; Choudhury, Srianand & Padmanab- 
han 2001a; Choudhury, Padmanabhan & Srianand 2001b; Schaye 
2001; Viel et al. 2002a). This paradigm is impressively confirmed 
by full hydrodynamical simulations (Cen et al 1994; Zhang, Anni- 
nos & Norman 1995; Miralda-Escude et al 1996; Hernquist, Katz 
& Weinberg 1996; Wadsley & Bond 1996; Zhang et al. 1997; The- 
uns et al. 1998; Machacek et al 2000; see e.g. Efstathiou, Schaye & 
Theuns 2000 for a recent review). 

In photoionization equilibrium, the optical depth, r, is related to 
the overdensity of the gas, A = p/{p), by 

T oc A 2 T-°' 7 /ri2 oc A 2 " ' 7 ^- 1 ) /Tia . (1) 

Here, T = Ti2 10~ 12 s _1 is the hydrogen photo-ionization rate 
and T(A) the temperature of the gas. The associated transmission 
F = exp(— r) = F /F c is the observed flux (F ) divided by the 
estimated continuum flux (F c ). Photo-ionization heating and cool- 
ing by adiabatic expansion introduce a tight relation T — To A 7-1 
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in the low-density IGM responsible for the Lyman-a forest (Hui & 
Gnedin 1997; Theuns et al. 1998). The above equation has been ex- 
tensively used, especially to probe the matter clustering (Hui 1999; 
Nusser & Haehnelt 1999; Pichon et al. 2001; Viel et al. 2002b; 
Croft et al. 2002; McDonald 2003; Rollinde et al. 2003). 
The UV-background that causes the photo-ionization is dominated 
by massive stars and quasars (Haardt & Madau 1996; Giroux 
& Shapiro 1996). The amplitude of the corresponding photo- 
ionization rate as a function of redshift, and the relative im- 
portance of the different sources, are relatively uncertain. Fardal, 
Giroux & Shull (1998) have derived the H I and He II photoion- 
ization history by modelling the opacity of the IGM, using high- 
resolution observations of H I absorption. They find Ti2 = 1 — 3 at 
redshift 2 = 2 — 4. Haardt & Madau (2001) have combined models 
for the emissivity of galaxies and quasars with calculations of the 
absorption of UV photons in the IGM, and estimate Ti2 ~ 1 — 2 at 
redshift z = 2 — 3. More recent observations suggest that Lyman 
break galaxies may dominate the UV-background at z — 3 (Steidel 
et al 2001). In simulations, assuming a standard Big Bang baryon 
fraction, the value of F12 has to be between 0.3 and 2 at a redshift 
z — 2 — 3 in order to reproduce observed Lyman-a forest proper- 
ties, such as the mean transmission and the column density distri- 
bution (Hernquist et al. 1996; Miralda-Escude et al. 1996; Rauch et 
al. 1997; Zhang et al. 1997; Choudhury et al. 2001a; Haehnelt et al. 
2001; McDonald & Miralda-Escude 2001, erratum 2003; Hui et al 
2002; Tytler et al. 2004; Bolton et al. 2005). 
An independent way for estimating F is the proximity effect. Lo- 
cally, the UV-field may be dominated by a single source, such as 
a bright quasar, leading to a deficit of absorption lines sufficiently 
close to the quasar. Because the amount of absorption is in gen- 
eral increasing with redshift, this reversal of the trend for redshifts 
close to the emission redshift of the quasar is called the 'inverse' 
or 'proximity' effect (Carswell et al. 1982; Murdoch et al. 1986). 
The strength of this effect depends on the ratio of ionization rates 
from quasar and UV-background, and since the quasar's ionization 
rate can be determined directly, Ti2 can be inferred. This method 
was pioneered by Bajtlik, Duncan & Ostriker (1988) but more re- 
cent data have yielded a wide variety of estimates (Lu, Wolfe & 
Turnshek 1991; Kulkarni & Fall 1993; Bechtold 1994; Cristiani et 
al. 1995; Fernandez-Soto et al. 1995; Giallongo et al. 1996; Lu et 
al. 1996; Srianand & Khare 1996; Cooke, Espey & Carswell 1997; 
Scott et al 2000, 2002; Liske & Williger 2001). Scott et al. (2000) 
collected estimates from the literature which vary over almost an 
order of magnitude at z = 3, i.e. 1.5 < Ti2 ~ 9. 
In the standard analysis of the proximity effect it is assumed that the 
matter distribution is not altered by the presence of the quasar. The 
only difference between the gas close to the quasar and far away is 
the increased photoionization rate in the vicinity of the QSOs. An 
important consequence is that the strength of the proximity effect 
should correlate with the luminosity of the quasar but such a corre- 
lation has not been convincingly established (see Lu et al. 1991; 
Bechtold 1994; Srianand & Khare 1996; see however Liske & 
Williger 2001). It is in fact likely that the quasar will be in an over- 
dense region. Indeed, the presence of Lyman-a absorption lines 
with redshift z a b s greater than the quasar redshift z cm suggests pos- 
sible excess clustering of the IGM material around QSOs (Loeb & 
Eisenstein 1995; Srianand & Khare 1996). Furthermore, in hierar- 
chical models of galaxy formation, the super-massive black holes 
that are thought to power quasars are in massive haloes (Magor- 
rian et al. 1998; Marconi & Hunt 2003; Haring & Rix 2004), which 
are strongly biased to high-density regions. If the accretion rate in 
quasars is close to the Eddington limit, then it seems plausible that 



the IGM density close to the quasar is significantly higher than the 
mean. 

Recent studies of the transverse proximity effect by Croft (2004) 
and Schirber, Miralda-Escude & McDonald (2004) also suggest 
excess absorption over that predicted by models that assume the 
standard proximity effect and isotropic quasar emission. If this is 
not due to an increase in density close to the quasar, it might im- 
ply that the quasar light is strongly beamed, or alternatively that the 
quasar is highly variable. Interestingly, neither of these affects the 
longitudinal proximity effect discussed in this paper. 
Observations of the IGM transmission close to Lyman break galax- 
ies (LBGs) show that the intergalactic medium contains more neu- 
tral hydrogen than the global average at comoving scales 1 < r 
(Mpc) < 5 h^ 1 (Adelberger et al. 2003). As the UV photons from 
the LBGs can not alter the ionization state of the gas at such large 
distances, it is most likely that the excess absorption is caused by 
the enhanced IGM density around LBGs. It is worth noting that 
various hydrodynamical simulations have trouble reproducing this 
so-called galaxy proximity effect (e.g. Kollmeier et al. 2003, Brus- 
coli et al. 2003, Maselli et al. 2004, Desjacques et al. 2004). If a 
similar excess of density around quasar host galaxies exists and is 
not taken into account, then a determination of T from the proxim- 
ity effect will be biased high. 

In this paper, we present a new analysis of the proximity effect 
of very bright quasars observed as part of the ESO-VLT Large 
Programme (LP) 'Cosmological evolution of the Inter Galactic 
Medium' (PI Jacqueline Bergeron). This new method allows one to 
infer the density structure around quasars. The method is based on 
the cumulative distribution function (CDF) of pixel optical depth, 
r, and so avoids the Voigt profile fitting and line counting tradition- 
ally used. Using r instead of the transmission, F = cxp(— r), has 
the great advantage that we can take into account the strong redshift 
dependence (r) cx (1 + z) a with a « 4.5. 

We begin by briefly describing the data used in this paper. We out- 
line the procedure in Section 3 and illustrate it using hydrodynam- 
ical simulations in Section 4. The application to the high signal 
to noise and high resolution spectra of the ESO-VLT Large Pro- 
gramme is described in Section 5. Our analysis requires that the 
density be higher close to the quasar. Results and future prospects 
are discussed in Section 6. Throughout this paper, we assume a flat 
universe with Q m = 0.3, Q.a = 0.7 and h = 0.7. 

2 THE DATA 

2.1 The LP quasar sample 

The observational data used in our analysis were obtained with the 
Ultra- Violet and Visible Echelle Spectrograph (UVES) mounted 
on the ESO KUEYEN 8.2 m telescope at the Paranal observatory 
for the ESO-VLT Large Programme (LP) 'Cosmological evolu- 
tion of the Inter Galactic Medium' (PI Jacqueline Bergeron). This 
programme has been devised to gather a homogeneous sample of 
echelle spectra of 18 QSOs, with uniform spectral coverage, reso- 
lution and signal-to-noise ratio suitable for studying the intergalac- 
tic medium in the redshift range 1.7—4.5. Spectra were obtained 
in service mode observations spread over four periods (two years) 
covering 30 nights under good seeing conditions (< 0.8 arcsec). 
The spectra have a signal-to-noise ratio of ~40 to 80 per pixel and 
a spectral resolution > 45000 in the Lyman-a forest region. Details 
of the data reduction can be found in Chand et al. (2004) and Aracil 
et al. (2004). In our analysis we have only used absorption lines that 
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r (Mpc/h) 

Figure 1. Transmission F = exp(— r) as a function of luminosity distance for the LPQSOs listed in Table 1. The emission redshift, z cm , is indicated between 
brackets and increases from top to bottom. The evolution of the optical depth with redshift (see Section 3.4) is removed to compute the mean transmission 
F = (exp(— r/r(z))} as a function of luminosity distance (bottom panel). The proximity effect is clearly seen as an increase in mean transmission close to 
the quasar. 
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Table 1. Properties of the Large Programme QSOs in our sample. The 
redshift of emission (details are given in second to fourth columns) has 
been determined using different emission lines. The luminosity, L, in h — 2 
ergs - Hz -1 (last column) is computed assuming a Q m = 0.3 flat Uni- 
verse and a spectral index of 0.5. 



quasar 




2cm 




log(L) 




mean value 


used lines 


ref. 




Q0122-380 


2.203 


Ha, Mg II 


2 


31.633 


PKS 1448-232 


2.220 


Ha, Mg II 


2 


31.527 


PKS0237-23 


2.233 


Ha, Mg II 


2 


31.665 


HE0001-2340 


2.267 


Mg II 


1 


31.649 


QO 109-35 18 


2.404 


Mg II 


1 


31.819 


HE2217-2818 


2.414 


Mg II 


1 


31.994 


Q0329-385 


2.440 


Ha, Mg II 


2 


31.278 


Q0453-423 


2.658 


Lyman-a, C IV, Si IV 


3 


31.709 


PKS0329-255 


2.736 


C IV 


1 


31.577 


Q0002-422 


2.767 


Lyman-a, C IV, Si IV 


3 


31.721 


HE0940-1050 


3.068 


Civ 


1 


32.146 


PKS2126-158 


3.267 


Lyman-a, C IV, Si IV 


4 


32.132 



Average determinations of 2 cm are taken from Espey et al 1989 (2), Bech- 
told et al. 2002 and Srianand & Khare 1996 (3), using a correction factor 
suggested by Fan & Tytler 1994), Tytler & Fan 1992 (4) or (re)done in this 
paper (1, Section 2.1). 



are between the Lyman-a and the Lyman-/3 emission lines of the 
quasar. 

Six of the eighteen LP QSOs (HE 1158-1843, HE 1347-2457, 
HE0151-4326, HE 1341-1020, Q 0420-388 and 
HE 2347—4342) show signatures of associated absorption close to 
the emission redshift of the QSO, and are therefore excluded from 
our analysis. The remaining twelve are listed in Table 1, which 
gives the name of the QSO, its redshift, 2 cm , and the monochro- 
matic luminosity at the Lyman limit (L). 

An accurate determination of the emission redshift is important for 
the analysis. Espey et al. (1989) have found that the Ha line is red- 
shifted by an average 1000 km s _1 with respect to lines from high 
ionization species and has statistically a similar redshift as the lines 
from the low ionization species. The mean difference between Ha 
and Mg II redshifts in their sample is ~ 107 km s _1 with a stan- 
dard deviation of ~ 500 km s _1 . A redshift measurement based on 
Ha and other low ionization lines is available for 4 of the QSOs 
(Espey et al. 1989, see Table 1). We consider the mean redshift of 
all observed lines for these systems. When the Mg II emission line 
is observed, as it is for three additional QSOs, we fit the profile 
with the doublet of Mg II and a polynomial continuum to deter- 
mine accurately the redshift. Fig. 2 shows the results of this fitting 
procedure for the three QSOs. On average, these redshifts should 
be within an rms of 500 km s _1 from the systemic redshift. For 2 
of the QSOs, Bechtold et al. (2002) and Srianand & Khare (1996) 
used the C IV, Si IV and Lyman-a lines to determine the redshift 
of emission, and applied the correction factor suggested by Fan & 
Tytler (1994). Otherwise, we use the C IV emission line for two 
other QSOs and the determination from Tytler & Fan (1992) for the 
last remaining QSO. Therefore 7 out of 12 redshifts of the QSOs 
in our sample are determined accurately using the Ha or Mg II 
emission line, and 2 using the correction factor from Fan & Tytler 
(1994). 

The QSO luminosity at the Lyman limit is computed from the avail- 
able B-magnitude. The QSO continuum slope is assumed to be a 
power law, F\ ~ X a . We use a = —0.5 as Francis (1993). We 
checked that within a reasonable range of a = —0.5 to —0.7 (e.g. 
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Figure 2. Determination of the emission redshift, z om , of three QSOs using 
Mg II line (see Table 1). For each quasar, the Mg II emission lines (AA 
2796.35, 2803.53) are fitted (two dashed lines) on the top of a polynomial 
continuum (long dashed line). The final profile is shown with a solid line. 



Cristiani & Vio 1990), our main result (i.e. the density structure 
around quasar) is not affected by our choice of a. 
All possible metal lines and Lyman-a absorption of a few sub-DLA 
systems (there are no DLA systems in the observed spectra) are 
flagged inside the Lyman-a forest. The entire line is removed up 
to the point where it reaches the continuum. We have not removed 
the Lyman-a absorption associated with metal line systems (i.e. 
systems with N(H I)< 10 19 cm~ 2 ) but the metal absorption lines 
themselves are flagged and removed. 

Continuum fitting of the quasar spectra is very important for our 
analysis. As most of the QSOs in our sample are at lower redshifts 
where line-crowding is not a problem, all the available line free 
regions are used to fit the continuum. The procedure used to com- 
pute the continuum has been calibrated and controlled using syn- 
thetic spectra by Aracil et al. (2004). They estimated that errors in 
the continuum amount to about 2% at z ~ 2.3. The transmission 
F = exp(— t) for each quasar in Table 1 is shown in Fig. 1 up to a 
luminosity distance of 20 /i _1 Mpc. 



2.2 The mock LP quasar sample 

We use mock spectra generated from hydrodynamical simula- 
tions to illustrate and test the method described below. The 
simulated cosmological model has (fi m , Qa, h, Qbh 2 , as) = 
(0.3,0.7,0.65,0.019,0.9), where the symbols have their usual 
meaning, and we have used CMBFAST (Seljak & Zaldarriaga 1996) 
to generate the linear power- spectrum at the starting redshift z = 
49, assuming scale-invariant n = 1 primordial Gaussian fluctua- 
tions. The baryons are heated and ionized by an imposed uniform 
ionizing background as computed by Haardt & Madau (1996), and 
updated by Haardt & Madau (2001). We have increased the photo- 
heating rates during hydrogen and helium reionization to satisfy the 
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constraints on the temperature of the intergalactic medium as deter- 
mined by Schaye et al. (2000). This ionizing background was re- 
ferred to as 'designer model' in that paper. In this model, hydrogen 
reionizes at z — 6.5 and Helium at z — 3.5. The amplitude of this 
background is scaled so that the mock spectra reproduce the evolu- 
tion of the mean transmission exp(— r) with redshift. The simula- 
tion is performed with a modified version of HYDRA (Couchman, 
Thomas, & Pearce 1995) as described in more detail in Theuns 
et al (1998). HYDRA combines Smoothed Particle Hydrodynamics 
(SPH, Lucy 1977; Gingold & Monaghan 1977) to represent the gas, 
and P3M (Couchman 1991; Hockney & Eastwood 1981) to solve 
Newtonian gravity. It follows the evolution of a periodic, cubic re- 
gion of the universe of co-moving size 20 h^ 1 Mpc to a redshift 
z — 1.7, using 256 3 particles of each species, and a co-moving 
gravitational softening of 20ft~ 1 kpc. Non-equilibrium gas cooling 
and photo-heating is implemented, using the rates of Theuns et 
al (1998). Cold, dense gas particles are converted to collisionless 
stars, but there is no feedback included. The resolution of the sim- 
ulations is close to sufficient to resolve the Lyman-a forest. 
As the simulation is running, we store the physical state of the IGM 
along many thousands of uncorrelated sight lines, which are later 
patched together into mock spectra with a large redshift extent. 
A full simulated spectrum typically requires around 20 individual 
sightlines through the simulation box, at 2 = 2. We use the pho- 
toionization package CLOUDY 1 to compute the ionization balance 
of the gas in the optically thin limit, in the presence of the Haardt 
& Madau (2001) ionizing background. We generate 20 mock spec- 
tra for each of our observed quasars taking into account the excess 
ionization by a QSO of luminosity similar to the mean luminosity 
of the QSOs in our sample. A mock spectrum for a given QSO ex- 
tends over the same wavelength range as that QSO, has the same 
pixel size and spectral resolution, and we add noise to the simu- 
lated spectra with the same wavelength and flux dependence. Ex- 
cept for metals, which are flagged in the real data and are not used 
in this analysis, this procedure ensures that we impose the same bi- 
ases in the reconstruction of the mock spectra, as are present in the 
real data. The analysis procedure described next does not rely on 
simulations: we only use simulated spectra to demonstrate that the 
method works. 



that this predicted PDF differs significantly from the measured PDF 
close to the QSO. Indeed, radiation from the QSO will decrease the 
neutral hydrogen fraction in its surroundings, which in turn will 
lead to a decrease of the reference optical depth. This is the usual 
proximity effect. In contrast if the QSO lives in a high density 
environment, as is expected, then the optical depth will increase. 
Therefore we need to introduce another function f(r), which de- 
scribes the effect of the QSO on the PDF, such that the optical depth 
scales as r/(/(r) r (z)). When radiation dominates, f(r) <€. 1, 
and the optical depth becomes very small. When density dominates, 
f(r) >• 1, and the optical depth becomes very large. The explicit 
expression for f(r) is given in Eq. (9) below. Of course, the pres- 
ence of the QSO might also change the shape of the PDF. Our main 
assumption in this paper is that the shape does not change, and we 
demonstrate below that this is a good assumption. 

By comparing the predicted to the measured optical depth 
PDFs, we can determine the relative importance of radiation versus 
density enhancement. As we explain in more detail below, we can 
off-set a higher amplitude of the background ionization rate with a 
decrease in the over density: our determination is degenerate in this 
respect. So instead of assuming no over-density and inferring the 
background ionization rate, T(z), as is usually done in the analysis 
of the proximity effect, we will assume a given value of T(z), and 
recover the corresponding over-density. 

This method is based on comparing optical depth PDFs. We 
characterise the difference between two PDFs, by computing the 
maximum absolute difference between the corresponding cumula- 
tive PDFs. Given bootstrap re-sampled realisations of these PDFs, 
we can associate a probability to a given difference in cumulative 
PDFs. This then allows us to associate a given probability of the 
over-density as a function of distance to the QSO, for an assumed 
value of the ionization rate. This is the basis for the inferred over 
density as a function of distance to the LP QSOs shown in Fig. 10 
below. 

In the rest of this section we explain this procedure in more detail, 
and test it on our mock QSO spectra. Readers not interested in these 
details may want to skip directly to Sect. 5, where we apply the 
method to the LP data. 



3.2 The optical depth - density relation 



3 METHOD 
3.1 Overview 

Our aim is to investigate the density structure around high-redshift 
luminous quasars. We do so by investigating how the probability 
distribution (PDF) of optical depths, P(t), varies with distance to 
the quasar. Far away from the QSO, P(r, z) evolves with redshift 
mainly because the mean optical depth decreases with redshift due 
to the expansion of the Universe. In the appendix we show that the 
shape of the PDF does not evolve much over the relatively small 
redshift range 1.8 < z < 3.1 covered by our QSO sample. There- 
fore we can define a redshift-independent scaled optical depth dis- 
tribution, P(t,z) = P(t(z)/tq(z)), which allows us to predict 
the optical depth PDF at any z. The ability to take into account 
the strong redshift evolution of the mean optical depth is a major 
advantage of our method. 

We can now compare this predicted optical depth PDF with 
the measured one, as a function of distance r to a QSO. We show 
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We analyse the proximity effect using the cumulative distribution 
of pixel optical depths as a function of distance to a quasar. The 
starting point is Eq. (1), which relates optical depth, r, to overden- 
sity, A = p/(p), 



A 2 a 1/(1-1-/3) 

r = to A oc A ' v ' , 
where 1/(1 + (3) = 2 - 



(2) 



T 



0.206 



1 

rT7 



0.02 

H(z = 2) 
H(z) 



- 0.7(7 - 1). and 

X X + 0.5F a(T) 
0.24 0.88 a(T 4 ) 



(3) 



is the Gunn-Peterson (Gunn & Peterson 1965) optical depth. Here, 
q(T 4 = W 4 K) = 4.19 x l(n 13 cm 3 s _1 is the hydrogen recom- 
bination coefficient (Verner & Ferland 1996) which scales approxi- 
mately oc T~ 0,7 close to T = 10 A K , H(z) is the Hubble constant 
at redshift z, X and Y are the hydrogen and Helium abundances 
by mass, respectively, and Q. b h 2 is the baryon fraction. We have 
assumed that hydrogen and helium are both almost fully ionized. 
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The exponent 7 and normalisation To of the temperature-density 
relation T = To A 7-1 , have been measured by e.g. Schaye et al. 
(2000) to be in the range 7 = [1 - 1.5] and To « 10 4 K in the 
redshift interval 2 < 2 < 3. How are the density and optical depth 
PDFs related? 

Let Pa (A, z)dA be the density distribution at redshift 2. 
The probability distribution function (PDF) for the optical depth 
P T (r, z)dr is obtained by combining Pa (A, z)dA with Eq. (2). 
At two different redshifts 21 and 22, say, P t (t,z)cLt will differ 
because to changes (see Eq. (3)) and because the density PDF, 
Pa (A, z)dA, evolves as structure grows. For the relatively small 
redshift range covered by the LP quasars, we show below that the 
redshift evolution of P T (r, 2) dr is dominated by that of the mean 
optical depth, To, and that the shape of the distribution does not 
change very much. This is true for the simulated quasar sample as 
well. The PDF of r is therefore given by 



P T (r,2) dr « (1 + /3)P A 



(- 



1+/3 



\T J T 



(4) 



and to a very good approximation, its redshift dependence is 
through ro (2) only. Therefore, given the PDF of r at several red- 
shifts covered by the LP sample, 1.7 < 2 < 3.1, one can accurately 
predict the scaling factor required to scale each PDF P r (r, 2) to the 
PDF observed at a given reference redshift, 2 = 2.25. We will call 
this the scaled optical depth PDF below. We emphasise here that 
the transmission is non-linearly related to the density. Since the 
median optical depth corresponds to a value of the flux within the 
noise around the continuum, the evolution with redshift cannot be 
taken into account with the transmission only. 
Thermal broadening and peculiar velocities prevent the unique 
identification of an overdensity, A, in real space, with a given op- 
tical depth, r, in redshift space. Therefore Pa (A, 2) dA does not 
refer to the real space over density, but the optical depth weighted 
overdensity, as used for example in Schaye et al. (1999). In the Ap- 
pendix we discuss a fitting function of Pa which is based on the 
fit introduced by Miralda-Escude, Haehnelt & Rees (2000) for the 
density distribution of the IGM. We show there that the shape of 
this function fits Pa well, but the best fitting parameters differ con- 
siderably from the real space density PDF. We also show that, in 
simulations, Pa varies little with redshift inl.7<2<3.1. 
A quasar's proximity effect will change the PDF of r. The change 
due to the increase in ionization rate can be accurately predicted 
by the appropriate scaling of To. However, the density PDF may 
change, as is expected for biased quasar formation, which will mod- 
ify accordingly the optical depth PDF. In our model, the shape of 
the density PDF is assumed to be unaltered, only the mean value is 
changed. This is our main assumption. Physically, this implies that 
feedback effects from the galaxy hosting the QSO such as winds, 
infall, or excess of clustering that may modify the density distri- 
bution itself, are neglected. The net effect of the quasar is then a 
rescaling of To. This scaling factor is determined as a function of 
distance r to the quasar, by comparing the measured PDF of r at 
r with the predicted one at the same redshift. The method is based 
on r, whereas what we observe is the transmission F — oxp(— r). 
We describe how to infer r from F next. 



t < T m i n , are lost in the noise, whereas high values of r, r > r max , 
cannot be recovered since the Lyman-a absorption is saturated. 
However, we can estimate the range r m i n < r < r max where r 
can be accurately recovered given the noise properties of the data. 
By using higher-order transitions one can accurately recover high 
values of r where Lyman-a is saturated but Lyman-/3 for exam- 
ple is not (Savage & Sembach 1991; Cowie & Songaila 1998; 
Rollinde, Petitjean & Pichon 2001; Aguirre, Schaye & Theuns 
2002; Aracil et al. 2004). However, here we only use the Lyman-a 
absorption from normalised spectra and recover r between r m i n = 
— log(l — 3a) ~ 0.1 and r max = — log(3a) ~ 2.5, where <t(A) 
is the rms noise as a function of wavelength. Note that r m i n = 0.1 
is a high value compared to the actual noise in most of the spec- 
tra. We use this limit to be conservative. Since we will use the 
cumulative probability distribution of r (CPDF, in the following 
all probability functions implicitly refer to P r , unless explicitly 
noted), we also keep track of the number of pixels below r m i n and 
above r max . The CPDF of this censored representation of the op- 
tical depth, CPDF roc (r), is therefore a portion of the full CPDF, 
CPDF(t) = P(t' < r), between r m i n and r max : 



CPDF rcc (r) 
CPDF rcc (r) 
CPDF rcc (r) 



0. T < Tmin 

CPDF(r) < r < r n 

1- T !> T max 



(5) 



The values of r m i n and r max depend on redshift because the noise 
level o does, but this dependence is very weak for our sample. This 
means that when we scale two recovered PDFs to the same refer- 
ence redshift, the scaled values of r m i n and r max will no longer be 
the same. For example at lower redshift (say, 2 = 2) higher over- 
densities A oc (t/tq(z = 2)) 1+ ^ can be recovered before the line 
becomes saturated than at higher redshift (2 = 3, say) because of 
the evolution of To(z). Conversely, lower over densities can be re- 
covered at 2 = 3 than at z = 2, before the line disappears in the 
noise. This could be exploited to increase the effective recovered 
overdensity range if the evolution of To was strong enough. We de- 
scribe how we scale PDFs to a common redshift next. 

3.4 Scaling of the reference optical depth to (2) 

We show in Sections 4.1 and 5 that the shape of the censored opti- 
cal depth cumulative distribution function in both simulations and 
observations, is nearly independent of redshift. These distributions 
refer to regions far away from the quasar (proper distance > 50 
Mpc//t) where the distribution of r is not modified by radiation 
from the QSO itself. The fact that the shape of the PDF is conserved 
means that redshift evolution can be modeled accurately by a sim- 
ple redshift dependence of the reference optical depth, To(z). We 
find the best fitting scaling tq(z) oc (1 + z) a by minimising the 
maximum absolute distance between scaled optical depth CPDFs 
(KS distance) within different bins in redshift. Note that the evolu- 
tion of the number of systems within a range of column densities, 
as used in most previous work on the proximity effect, is also de- 
scribed as a simple scaling. Errors in To (2) are estimated using a 
bootstrap resampling of chunks of proper size 10 h~ 1 Mpc. In the 
next steps, 10(2) is used to scale the optical depth of each pixel to 
a reference redshift of 2 = 2.25. 



3.3 The optical depth distribution 

At a given redshift only part of the PDF of optical depth, P T (r)dr, 
can be recovered from the observational data. Low values of r, 



3.5 The proximity effect 

We now consider the influence of a quasar on the optical depth 
distribution in the nearby IGM, scaled to the same reference red- 
shift using the function to(z). We consider the effect of both the 
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ionizing flux emitted by the quasar and that of a modified density 
distribution. 

Let the quasar emit ionizing photons with spectrum characterised 
in the usual way as 



47r J L {v,r) 



L 

4irr 2 



(-) 



-1 -2 TT -1 

erg s cm Hz 



= 4tt J 2 i{z) x f(T 21 ( — ) u(r,z). (6) 

V fHl / 

Here, L is the monochromatic luminosity of the quasar at the hy- 
drogen ionization threshold um- The corresponding ionization rate 
is (12.6/(3 + 4>)) J21 oj(r) 10 _12 s _1 , when one approximates the 
hydrogen photo-ionization cross-section with a power-law (Theuns 
et al. 1998, Table B4). The function w is 



4 n (r(z) [h-icm]) 2 10" 21 J21 (z) 
rL{z)^ 2 



(7) 



Here r(z) is the luminosity distance from the quasar at redshift 
z cm to the cloud at redshift 2, at the time the photons arrive there. 
For a given pixel, r is computed from the absorption wavelength of 
that pixel and the emission redshift of the quasar, using the equa- 
tions from Phillipps, Horleston & White (2002) for a fi m = 0.3 
flat cosmological model (in the future, it would be worthwile to 
investigate how our results depend on the assumed cosmology, as 
initiated for the standard proximity effect analysis by Phillipps et 
al. 2002). Note that this neglects possible infall or outflow close to 
the quasar. All distances are computed as a luminosity distance in 
the analysis. Yet, we may also define them as proper distance since 
proper and luminosity distances are almost equal up to 30 7i _1 Mpc 
at the redshifts of interest here. All quasars in our sample have a 
similar luminosity (Table. 1), they will then have a similar value of 
tl when Fi2 does not vary strongly, as is expected (e.g. Haardt & 
Madau 1996; Fardal et al. 1998). 

The total ionization rate Y in the IGM is the sum of that from the 
uniform background radiation, T IGM (z), and from the radiation 
from the quasar, T®(r, z). The increase in T will shift the PDF of 
t to smaller values, without changing its shape. Very close to the 
QSO, T(r) cx 1/r 2 diverges, hence according to Eq. (3), To — > 0, 
which is the usual proximity effect. 

However, we argued before that the quasar is likely to be in an 
overdense region, which will lead to an increase in r. We model 
this by assuming that the density close to the quasar is simply a 
scaled-up version of that far away from the quasar, i.e. 



PA(r,(l + *(r))A)dA = P A (A)dA. 



(8) 



Eq. (2) shows that this has the effect of increasing tq by a factor 
(1 + *(r)) 1/(1+/3) , shifting the PDF of r at a given r bin, to higher 
values without changing its shape. To simplify the notation, we will 
use p/{p) to refer to the density structure, or enhancement, around 
the quasar (i.e. 1 + 'J'), and A to refer to the distribution of density 
Pa. 

Note that we neglect a possible variation of temperature due to the 
ionizing flux from the quasar. Since the main modification to the 
ionizing background is the larger proportion of hard photons from 
the quasar, we assume that the change in temperature is not large 
enough to modify the optical depth distribution in a significant way. 
This argument will not be valid if the He II is not ionized. Avail- 
able observations indicate the epoch of He II reionization may be 
probably earlier than z ~ 3 (e.g. Theuns et al. 2002). 



The combined effect of a density increase and extra ionizing pho 
tons is to shift To by a factor 



TO ■ 



l + (r L /r) 2 



(9) 



The relative importance of quasar versus UV-background ionizing 
photons is characterised by t-l(z) 2 oc L/ri2(z) (where T IGM — 
Ti2 10~ 12 s -1 ). In the absence of any temperature enhancement the 
optical depth at r is globally scaled compared to the optical depth in 
the intergalactic medium. As a consequence, the distribution P(r) 
is simply scaled along the abscissa toward higher values in case of 
an overdensity (p/ (p) > 1) or lower values under the influence 
of the quasar ionizing flux (u> > 0). Thus, for a given distance r, 
there is an intrinsic degeneracy between the local density structure 
p(r) and the value of Ti2, combined in the above scaling factor. 
Therefore, if one modifies the value of Ti2, the recovered value 
of (p(r)/(p)) 1 ^ 1+/3 \ is scaled by a constant value I/T12 when 
r<ri and is independant of Ti2 when r > rj. 
Close to or far away from the quasar, this scaling is constant, which 
allows the shape of the density enhancement to be recovered. Then, 
despite the fact that the absolute value of p(r), when r <C tl, will 
depend on the value of Fi2 assumed in the analysis; the presence 
of a non-uniform density enhancement can in principle be revealed 
by this method. Conversely, if the underlying density enhancement 
is known through numerical simulations e.g., or if it is neglected 
as in the standard proximity effect analysis, Ti2 can be recovered. 
However, neglecting overdensities always implies an overestimate 
of T12, irrespective of the method. 

We now describe how the density structure is recovered and how 
errors are estimated. 



3.6 Estimation of the density structure and errors 

The density structure, p(r)/(p), can be inferred once the ionizing 
rate, Fi2(z), and the slope of the temperature-density relation, 7, 
are determined. We will illustrate how p/(p) changes with changes 
in these parameters. 

The mean scaled CPDF in the IGM, and its statistical uncer- 
tainty, are determined from bootstrap resampling pixels outside of 
the possible proximity region, at distances larger than 50 /i _1 Mpc 
proper. We characterise the difference between two PDFs by the 
maximum absolute distance (KS distance) between the correspond- 
ing cumulative distributions, just as in a Kolmogorov-Smirnov test. 
Bootstrap resampling allows us to associate a probability to a given 
value of this KS distance, V{KS). 

The proximity region is characterised by evaluating the scaled 
CPDF in radial bins from the background QSO. For each radial bin, 
the mean CPDF in the IGM is shifted according to Eq. (9), using 
our assumed value of Ti2 and for different values of the function 
(p( r )/(p)) 1//(1+ ' 3) • Given the probability associated with a given 
value of KS, we can determine a probablity associated with a given 
value of p(r), Pks(p(»")/(p))- The distribution of KS values of 
course depends on the number of pixels in each bin. Since we want 
to use small bins close to the QSO, we need to determine the proba- 
bility V(KS) for each bin separately, using only pixels outside the 
proximity region. 

We bootstrap the QSO sample, using different sub-samples of 
six quasars taken from the 12 quasars available in full sample. We 
can then define a global probability associated to p(r) as 



P(p(r)/<p)) = <P KS (p(r)/<p»> 



sub — sample 



(10) 
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Figure 3. Evolution with redshift of the cumulative distribution of optical 
depth, computed with mock LP spectra. Only pixels located far away from 
the proximity region are considered. Top panel: Cumulative distributions in 
five bins in redshifts centred on z =1.8, 2.0, 2.25, 2.5 and 2.95 (left to right, 
with alternate solid and dashed lines). The 3<r statistical error is shown with 
a vertical mark in both panels. Bottom panel Same distributions but after 
scaling each curve to the CPDF at z = 2.25 (thick curve in both panels) 
by a redshift dependent scaling factor r — » ttq(z = 2.25)/to(z). The 
scaled curves are all consistent within the 3<r error, showing that the shape 
of the distribution is independent of z. 
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Figure 4. Evolution of different percentiles of the scaled optical depth 
(z = 2.25) with luminosity distance to the background quasar, in mock LP 
spectra. Quasars are randomly located in the simulation box, which implies 
that no additional density structure around the quasars is expected statis- 
tically (i.e. p/(p) = 1). Mock spectra are computed assuming the mean 
luminosity of the LP sample, L = 5.4 X 10 31 h~ 2 erg/s/Hz and ri2=l. 
The distance where the amplitude of the ionizing flux from the quasar and 
in the IGM are equal (i.e. uj = 1, Eq. 7) is indicated by the vertical dashed 
line. Horizontal lines indicate the observational upper and lower limits in 
optical depth. 



which will allow us to characterise the density structure at different 
level of confidence. 

Note that this method is also able to recover Ti2, if one assumes 
p(r) = (p), i.e. the assumption made in the standard analy- 
sis of the proximity effect. Indeed, the above procedure can be 
done for different values of Ti2, while maximising the product of 

P(p(r) = (p)) over r. 

We will first apply the method to mock spectra in order to show 
that this method works well. We also use the simulations to show 
that our method of bootstrap sampling chunks and quasars gives 
realistic errors. 



4 PROXIMITY EFFECT USING OPTICAL DEPTH : 
VALIDATION OF THE METHOD WITH SYNTHETIC 
SPECTRA 

In this section, we use mock LP spectra, generated as described 
in Section 2.2. The proximity effect is implemented as described 
by Eq. (9), assuming the mean luminosity of the LP sample, L — 
5.4 x 10 31 h^ergs -1 Hz -1 and r i2 = 1, without and with addi- 
tional density enhancement. Note that the value used for Ti2 here 
needs not be equal to the value actually implemented in the sim- 
ulation itself. The different steps involved in the analysis, as de- 
scribed above, are now applied successively to the mock spectra. 
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Figure 5. Recovered density structure versus luminosity distance to the 
background quasar, from the analysis of mock LP spectra. Mock spec- 
tra, including additional ionization from the background quasar, are gen- 
erated from randomly positioned quasar (i.e. Pi n put/(p) = 1), The 2 and 
3 a confidence levels are indicated as blue region, and solid lines respec- 
tively. The input structure is well within the 2a confidence level except 
for a small bias, and a large increase of errors below ~ 10 /i _1 Mpc 
proper (luminosity and proper distances are similar up to 30 /i -1 Mpc), 
which are explained by the modifications of the CPDF due to the noise 
(see text for details). The luminosity of the quasars and the parameters Ti2 
and 7 are identical in the analysis and the generation of Mock spectra (i.e. 
L = 5.4 x 10 31 h- 2 ergs- 1 Hz- 1 ; Ti2 = 1; 7 = 1.5). 



Our assumptions and the ability of the method to recover the den- 
sity structure will be discussed. 



4.1 Evolution of the optical depth with redshift 

Since we are first interested in the evolution of the optical depth 
in the IGM, we consider here pixels at a distance larger than 50 
/t _1 Mpc proper to the quasar only. The evolution of the CPDF 
within five bins in redshift centred at z =1.8, 2.0, 2.25, 2.5 and 
2.95 is displayed in the top panel of Fig. 3. The main evolu- 
tion is driven by the mean density that increases, together with 
the mean optical depth ro(z), with redshift. This corresponds to 
a shift of the CPDF along the abscissa toward higher values. As 
explained in Section 3.4, a simple scaling of the reference optical 
depth to(z) is used to remove this primary evolution. Parameter- 
ising tq(z) oc (1 + z) a gives a best fitting value of a m 4.5. 
Although some scatter is present, half of 50 different samples pre- 
fer a value 4 < a < 4.5. Once the optical depth at each pixel is 
scaled using this relation, the CPDF computed within the same bins 
are displayed in the bottom panel of Fig. 3. We find then that the 
shape is indeed conserved, to the level of accuracy of our sample. 
In our mock samples, the ionizing background T(z) varies only 
weakly with z over the range 1.7 < z < 3.1, as does the tem- 
perature T of the IGM. Therefore a scaling close to a = 4.5 is 
indeed expected from Eq. (3), given the high redshift approxima- 



tion H(z) oc (1 + z) 3,/2 . Below we will generate several observed 
data sets by bootstrapping the LP quasars, and use either the best 
fitting exponent in tq(z) oc (1 + z) a for each sample, or a fixed 
value of a = 4.5. 



4.2 Proximity effect 

Once the main evolution of optical depth with redshift is removed, 
we can concentrate on its change with distance to the quasar. Fig. 4 
shows the evolution of different percentiles of the optical depth 
with luminosity distance to the mock background quasar. Note that 
we only model the excess ionizating radiation from the QSO: there 
is no over density at the emission redshift (i.e. p/{p) = 1). We 
note that the relation between u and distance, Eq. (7), depends on 
the luminosity of the quasar. In our homogeneous sample, the lu- 
minosity of the QSOs, and then w, varies only within a factor of 
two from one quasar to another. For the mock spectra, since we as- 
sume an unique value of the luminosity of each quasar, the distance 
at which uo — 1 is the same for all mock spectra: it is shown as a 
vertical line in the figure. The effect of assuming a different lumi- 
nosity on the recovered over density is discussed in more detail in 
Section 5. 

Fig. 4 clearly reveals the decrease of r with decreasing radius, as 
the mock QSO starts dominating the ionization rate. Since in this 
case p/{p) = 1, the optical depth where u> — 1 must be a factor 
of two less than its value in the ambient IGM at r > 50 h^ 1 Mpc 
(Eq. 9). This is indeed observed here, for each percentile. Note 
how at small distances the optical depth is everywhere decreased 
below T m in, and how the different percentiles are almost all equal 
to the minimum optical depth. 



4.3 Recovery of a uniform density field 

This qualitative change with distance is now studied quantitatively 
to recover the underlying density field close to the background 
quasars. During the implementation of the proximity effect in the 
mock spectra, we assumed Ti2 = 1. Therefore, we shall use the 
same value in the analysis. A wrong estimate of Ti2 mostly leads 
to a re-scaling of p/{p) in the region of interest, close to the quasar. 
Although the simulation does not correspond to a unique value of 7 
(there is a dispersion in the temperature-density relation), the exact 
assumed value, if within the range specified above (Eq. 2), does not 
have a large influence on the recovered density; we assume here 
7 = 1.5. We will illustrate the amplitude of these effects on the 
analysis of the Large Programme quasars in Section 5. Here, since 
the quasars are randomly distributed in the simulation box, we must 
recover a uniform density with p(r) = (p). 

For each bootstrap sample (Section 3.6), we recover a different 
function to (z) for the evolution of r. However, very similar results 
are obtained using a fixed evolution (l + z) 4 ' 5 , which shows that er- 
rors on the estimation of tq(z) are not essential in the analysis. We 
then fit the change of the CPDF with distance to the quasar (Fig. 4) 
using Eq. (9). This allows us to recover a probability distribution of 
p(r) I (p), from the function Pks (Eq. 10). 

Our result is therefore expressed in terms of a probability for each 
value of p/ (p) at a given radius. Different levels of probability are 
shown in Fig. 5. The 2 and 3cr levels of confidence correspond to 
the blue region and to the solid lines respectively. The input struc- 
ture p/{p) = 1 is indeed accurately recovered at the 2a level for 
r > 1 /i -1 Mpc. In this particular case, the assumption of the stan- 
dard proximity effect is satisfied (see Introduction). Then, assuming 
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Figure 6. Recovered density structure versus luminosity distance from the 
analysis of mock LP spectra with an additional input density structure close 
to the quasar, (dp); nput . We assume L = 5.4 X 10 31 h — 2 erg s _1 Hz" 1 , 
Ti2 = 1 and 7 = 1.5. The 2 and 3cr confidence levels are indicated as blue 
region and solid lines respectively. The 3a confidence level is also indicated 
for a sample twice larger than the large programme sample (dashed lines). 



p/{p) = 1, the data (i.e. the optical depth CPDF in our analysis, 
but also the mean flux 2 ) are fitted with V = r trU o within the 3<r 
confidence level. Therefore, the real value of Ti2 may be recovered 
if the density field is uniform. 

However, at distance lower than 3 h~ 1 Mpc, a tendancy towards 
over-density together with a symmetric increase of errors is appar- 
ent. The reason is the following. When the ionizing flux from the 
quasar is high (close to the quasar), the optical depth in most of 
the pixels is below r m in (see Fig. 4). Then, the modeled (censored) 
cumulative function (computed from the CPDF in the IGM) is ev- 
erywhere equal to 1. As for the CPDF measured directly in the 
spectra, there will always be a fraction of the pixels above T m i n due 
to the noise (this fraction mostly depends on the signal to noise ra- 
tio). Therefore, the KS distance between theoretical and measured 
CPDFs will have a maximum probability at a value larger than 0. 
This is not the case far away from the quasar, where the theoret- 
ical CPDF, for the best fitting value of p/{p), is the mean of all 
measured CPDFs. Although most of this effect is included in the 
function Pks(p/ (p))> this asymmetry will favour a value of p/ (p) 
higher than 1. Besides, a lower p/(p), that is a larger under-density, 
will not modify the theoretical CPDF, as long as r is everywhere 
lower than r m i n . This explains the large error toward low p/ (p) for 
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Figure 7. Validation of the estimation of errors in the recovered den- 
sity structure from mock spectra, with an additional density enhancement 
(Fig. 6). The probability distribution of p/ (p) obtained at different radius 
and with one sample is shown as a histogram in the different panels. The 
corresponding radius (luminosity distance in h^Mpc) is indicated and in- 
creases from left to right and bottom to top. The range of most probable 
values of p/ (p) obtained from 50 different samples is indicated as an hori- 
zontal line. Each estimation of the most probable value stands between the 
3<t rejection level (vertical dotted lines). 



r < 10 /i^Mpc. 



4.4 Recovery of a density structure 

The issue at small distances discussed above should be less im- 
portant if an overdensity is present close to the quasar. Indeed, f 
will then remain above T m i n at lower distances. We have checked 
this effect by adding a unique density structure (directly to r, 
so in velocity space) in all spectra with the shape p(r) / (p) = 
1 + 3 exp(-(log(r)) 2 /0.6) (Eq. (8)). We will show in Section 5 
that, using this specific structure, the observed evolution of optical 
depth percentiles is well fitted by the evolution in mock LP spectra 
(Fig. 9). This input structure is indicated with a solid line in Fig. 6. 
The 2 and 3 a confidence levels for the recovered density struc- 
ture are shown in Fig. 6. It is again consistent with the input struc- 
ture. As an exercise, the analysis has been repeated with twice as 
many quasars (i.e. 24). The corresponding contour of the 3<r re- 
jection level are shown with dashed lines in Fig. 6. The constraint 
is more stringent and still in agreement with the input structure. 
As expected, the bias is not present anymore. Although this result 
is encouraging, one must remember that the same luminosity and 
density structure are used for all quasars, which would obviously 
not be the case in a real and larger sample. 



2 if the distribution of r is known between r m i n and r max , then the distri- 
bution of the flux is known between and 1 , which allows us to compute 
the mean flux too. 



We have shown in two different cases, with a uniform and with an 
enhanced density, that our analysis does recover the input structure. 
We now concentrate on the estimation of errors. 
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Figure 8. Evolution with redshift of the CPDF of the optical depth, from 
Large Programme spectra. Notations are the same as in Fig. 3. Top panel: 
Cumulative distributions in five bins in redshifts centred on z =1.8, 2.0, 
2.25, 2.5 and 2.95 (left to right, with alternate solid and dashed lines). Bot- 
tom panel : Same distributions but after scaling each curve to the CPDF at 
z = 2.25 (thick curve in both panels) by a redshift dependent scaling factor 

T -> t/tq(z). 



4.5 Validation of error estimates 

The analysis of one sample (of similar properties as the Large Pro- 
gramme sample), provides us with a probability distribution for the 
recovered density structure. To validate the estimation of errors, 
we generate and analyse 50 different samples of mock spectra. In 
Fig. 7, the results at different radius are reproduced in each panel. 
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Figure 9. Evolution of different percentiles (95%, 85% and 70%) of the 
distribution of scaled optical depth (z = 2.25) with luminosity distance 
to the background quasar. The 1<t statistical contours corresponding to the 
Large Programme spectra (with bootstrap resampling) are represented with 
blue regions in each panel. The position where lu = 1 (dashed line) is com- 
puted assuming ri2=l in the panels (a) and (b); and ri2=3.0 in panel (c). 
For comparison, the mean evolution of the same percentiles in mock LP 
spectra is shown (with solid lines and circles) assuming either Ti2=l and 
Pi (p) = 1 (panel a, from Fig. 4); ri2=l and the input density structure 
shown in Fig. 6 (panel b) or Ti2=3 and p/(p) = 1 (panel c). A larger 
ionization rate or a density enhancement are required to reproduce the ob- 
servations. Those two cases cannot be distinguished within our analysis. 



For each radius, the range of most probable values of p/{p) ob- 
tained for each sample is indicated by a thin horizontal line, while 
a specific probability distribution corresponding to one sample is 
shown. This procedure is done in the case of an additional density 
enhancement (Fig. 6). The best fitting value from different realisa- 
tions does always fall within the 3a rejection level estimated from 
a single sample. The same validation has been done without addi- 
tional density structure. The conclusion is the same, although the 
bias discussed above implies that the distribution at low radius is 
extended toward lower values while the best fitting value is shifted 
toward higher values. 

Our analysis has been successfully tested with a numerical sim- 
ulation, for the most probable result as well as the estimation of 
errors. We may now turn to the analysis of the ESO-VLT Large 
Programme. 



5 APPLICATION TO THE ESO-VLT LARGE 
PROGRAMME 

In this section, we perform, with the LP quasars, the same sequence 
of analysis described above. First, we have confirmed that the evo- 
lution of the mean transmission (F) with z is consistent with previ- 
ous determinations (e.g. Press, Rybicki & Schneider 1993; Schaye 
et al. 2003). In particular, this gives confidence in the continuum 
fitting procedure. 
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Figure 10. Recovered density structure versus luminosity distance to the 
background quasar from the analysis of the proximity effect in the Large 
Programme sample. The density structure, p(r)/(p), is recovered within 
different bins in distance to the background quasar, from the evolution of 
the optical depth distribution in the vicinity of the quasar, as compared to 
the distribution in the IGM. Since the optical depth is a function of the 
density, the temperature and the amplitude of the ionizing flux, the resulting 
density structure depends on the slope of the temperature-density relation, 
7, and on the amplitude of the background ionizing flux (defined by the 
parameter Ti2). The value of 7 is fixed to 1.5 since uncertainties in it are 
small enough to have little influence on the result. The 2cr contours of the 
recovered density structure are shown for ri2=0.3 (panel a); ri2=l (panel 
b) and Ti2=3 (panel c). A value of Ti2 ~ 1 is favored to recover the 
density enhancement derived around the most massive halo at redshift 2 = 
2 in the Millennium simulation (Springel et al. 2005), which is overplotted 
in each panel with diamonds. 

Then, we compute the evolution of optical depth with redshift, dis- 
played in Fig. 8 (upper panel). It is stronger than in the mock spec- 
tra and seems to favor a — 6, when fitted with To (2) oc (1 + z) a . 
However, a slope of 4.5 is allowed within a 3a confidence level. 
More important, our results are not modified, within the statistical 
errors, whether we use a — 4.5 or the actual fit. The CPDF of the 
scaled optical depth is shown in Fig. 8 (bottom panel) with the best 
fitting result for To (2) . Observations are also consistent with the as- 
sumption that the shape of the CPDF does not evolve from z = 3.2 
to z = 2.2. 

Once the evolution with redshift is removed, the scaled optical 
depth CPDFs are computed within different bins in distance to the 
quasar. The lcr statistical contours (from a bootstrap resampling) 
of the evolution of different percentiles are shown in Fig. 9 (grey 
regions in each panel). We note here that the different percentiles 
are scaled roughly by the same amount at any given radius (when 
the lowest contour of r is larger than the minimum value, i.e. the 
lower dotted line). This corresponds to the fact that the shape of 
the CPDF is conserved when one gets closer to the quasar (at the 
level of accuracy of our sample). This gives confidence in our main 
assumption that a simple scaling of the reference optical depth is 
sufficient. 

In order to recover the density structure, values of Fi2 and 7 have 



to be fixed first. As mentioned earlier, the expected value of 7 is 
between 1 and 1.5 and we use 7 = 1.5 in most of our analysis. The 
value of Ti2 is between 0.3 and 3 (aside from measurements from 
standard proximity effect analysis), we use Ti2 = 1. In the previous 
section, the mean evolution of optical depth percentiles was com- 
puted in mock LP spectra without additional density structure and 
using Ti2 =1 (see Fig. 4). It is overplotted for comparison in Fig. 9, 
panel (a). In the data, there is no clear change in the percentiles at 
a radius where ui = 1 (for Ti2 = 1) and even at the lowest radii 
considered here, the highest percentiles do not reach the minimum 
optical depth. In contrast, the presence of the ionizing photons from 
the QSO already strongly affects the optical depth percentiles in 
mock spectra. Thus, the addition of a density structure is required to 
counterbalance the increase of the ionization rate. This is shown in 
panel b where we overplot the evolution of optical depth percentiles 
in mock spectra including a density structure around the quasar, as 
described in Section 4.4 (Fig. 6). This provides then a better fit to 
the observed evolution. The probability distribution of p/{p) asso- 
ciated to the Large Programme QSOs is directly recovered through 
the procedure described in Section 3. The 2er confidence region is 
then displayed in Fig. 10, again for Ti2 = l and 7 = 1.5 (panel b, 
blue region). A uniform density is rejected at the 2o level for r < 10 
proper hT 1 Mpc. 

This recovered profile can then be compared to expected density 
profile from simulation. For this purpose, we have used the Millen- 
nium simulation (Springel et al 2005). This dark-matter only sim- 
ulation evolved 2160 3 particles in a box of size 500 h^Mpc, and 
has f2 ra = 0.25 and erg = 0.9. Since the LP quasars are very lumi- 
nous, we extract the averaged density profile around the most mas- 
sive halo at redshift z = 2 in the simulation. The profile, smoothed 
over 2.5 ft _1 Mpc is shown as diamonds in Fig. 10. The similarity is 
encouraging, in particular the fact that both profiles start to increase 
at the same radius ~ 10 /i _1 Mpc. 

The effect of varying Fi2 and 7 is investigated next. It is reason- 
able to assume that 7 is within 1 and 1.5 (see Eq. 2). Since we 
actually recover (p/(p)) 2_0 ' 7< ' 7_1 - ) , varying 7 only scales p/{p) in 
a logarithmic plot. The effect is negligible compared to statistical 
errors. As for Ti2, we have shown in Section 3 that, for r IS j-l, 
p/(p) is proportional to (l/Fi^) 1 ^. Therefore, the observed op- 
tical depth percentiles evolution could also be reproduced in mock 
spectra with a larger value of Ti2, which decreases the influence 
of the quasar ionizing flux (the radius where u = 1 is shifted to- 
wards lower distance). The same quality of fit in Fig. 9 (panel c) is 
indeed obtained with the evolution of optical depth percentiles in 
mock spectra without density structure but with a larger value of 
Ti2 (3 instead of 1). Similarly, the 2o confidence region of p/{p) 
is shown for Ti2=3 in panel (c) of Fig. 10. The recovered density 
structure is reduced, and a uniform density can be rejected at the 
2<7 level for r < 2ft -1 Mpc only. Yet, this may as well be explained 
by the systematic bias in the recovered structure at small distances 
(Fig. 5). Higher values of Ti2 would result in an under density at 
small distances. Conversely, a lower value of Ti2 enhances the re- 
covered density structure, which is demonstrated for Ti2 = 0.3 in 
panel (a) of Fig. 10. 

One may then ask the question of which value of Fi2 will allow 
the observation to be consistent with a uniform density p/{p) = 1. 
This corresponds to the standard proximity effect applied to opti- 
cal depth statistics. If one requires that an uniform density is not 
rejected at more than 2<r, within each bin in distance, Ti2 is con- 
strained to be within the range 3.6-15. This is consistent with the 
range of estimates obtained from standard proximity effect analy- 
sis using line counting statistics (Fi2 — 1.5 — 9). We could also 



assume the density profile based on the Millenium simulation to 
recover Ti2. In this case, Fig. 10 shows that 0.3< Ti2 ~ 3. 



6 CONCLUSION 

In this article we presented a method to probe the density struc- 
ture around quasars, using a new analysis of the proximity effect in 
absorption spectra of quasars. In the vicinity of the quasar, the ad- 
ditional ionizing photons increase the total ionizing rate which de- 
creases the Lyman-a absorption. Simultaneously, an increase of the 
density around the quasar (as expected from biased galaxy forma- 
tion) would increase the absorption. Both effects are better probed 
with the optical depth than directly with the flux. Our method also 
avoids fitting the individual absorption lines, and directly uses the 
cumulative distribution of Lyman-o? optical depths observed in each 
pixel. We then model the change of this distribution under modifi- 
cation of the density field and the amplitude of the ionizing rate, 
Ti2. Our method therefore allows one, in principle, to estimate the 
density enhancement around host galaxy of quasars, once Ti2 is 
fixed by some other method. 

We first use a LCDM high resolution simulation to validate our 
method. The information on Ti2 and density field is accurately re- 
covered. This gives us confidence to perform our analysis on the 
real data. We then use the spectra of 12 quasars with highest lumi- 
nosity at 2.2 < z < 3.3 from the ESO-VLT Large Programme. 

Our method has revealed the presence of an overdensity for 
2 5 r £ 10 proper /i _1 Mpc, assuming Ti2 < 3. We have shown 
that it is consistent with a density profile around the most massive 
halo at redshift z — 2 in the Millenium simulation for Fi2 = 1 
(Fig. 10). In the future, a similar analysis should be done with a 
larger sample of spectra, covering different redshift and luminosity 
ranges. Together with synthetic density profiles computed around 
halos of different mass in a large simulation such as the Millenium 
one, this will be very useful to understand better the relation be- 
tween the environement of the quasar and its host galaxy, and their 
evolution with redshift. New constraints could also be put on the 
mass-luminosity relation. 

Without the knowledge of Ti2, and due to the limited statistics, we 
could not discard an uniform density profile. Indeed, consistently 
with standard proximity effect analysis, observations are also mod- 
elled without density enhancement, assuming a higher value of Ti2 . 
Yet, due to the specific scaling of the density profile with Ti2, a 
larger statistics could already allow us to distinguish between dif- 
ferent type of profiles, from a simple power law to the existence 
of alternate shells corresponding to over and under density regions. 
This would be valuable to test the presence of winds, or other spe- 
cific feedback effects. Thus it is important to confirm our tentative 
finding of density enhancement around QSOs (for Ti2 < 3) at high 
significant level using a bigger sample. 

Another application of this analysis concerns the transverse prox- 
imity effect. The modeling of the observations obtained with Ly- 
man break galaxies or quasars has been done either with simula- 
tions (Croft 2004; Maselli et al. 2004) or analytical model for the 
density (Schirber et al. 2004). These works could not reproduce 
the amplitude of the observed effect with normal properties of the 
quasar, such as anisotropy of the beaming and variability. Combin- 
ing the constraints on the optical depth evolution along and trans- 
verse to the line of sight could be a way to disentangle the different 



Density structure around quasars 13 

parameters, that is the density structure, Ti2 and the properties of 
the quasar. 
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APPENDIX 

A key assumption in this paper is that the PDF of the scaled 
optical depth, P(t(z) /to(z)), varies little with redshift. Here, 
ro(z) oc (1 + z) a is a redshift dependent scaling function, with 
a ~ 4 — 5. We showed in Fig. 3 that this is true for the full op- 
tical depth in mock spectra in the range 0.1 < t/to < 100 and 
in Fig. 8 for the censored, recovered optical depth in the range 
0.1 < t/t < 2.5, both at the reference redshift z = 2.25. We 
illustrate in Fig. 1 1 the limitation of this assumption, by showing 
the scaled P(t/tq) over a larger range. As expected, the PDF be- 
comes wider in its tails as the density field becomes increasingly 
non-linear at lower redshifts. However in the range in which we 
use the PDF, r m in < t < r max , this dependence is very weak 
indeed. It also becomes clear from this figure that we cannot reli- 
ably determine the shape of the PDF around the maximum for the 
signal-to- noise ratio in the LP quasars, even at the higher redshifts 
z ~ 3. This is also clear from Eq. (2): r ~ 0.07 < r m i n at the 
typical volume-averaged overdensity A = 1/3 at z = 3, when 
ro ~ 0.7. Uncertainties associated with continuum fitting make 
this part of the PDF uncertain, in addition to these signal-to-noise 
issues. Note that in our previous analysis we used the recovered 
optical depth from mock samples, which were continuum-fitted to 
mimic observed samples. This will strongly affect the shape of the 
PDF at these low values, and therefore it is not very worthwhile to 
try to take these lower optical depths into account for the present 
analysis. In contrast, the mock PDF is uncertain at high r, where it 
becomes sensitive to lack of self-shielding and other numerical un- 
certainties in high density regions. Given these limitations, can we 
understand the shape of the optical depth PDF in the intermediate 
regime? 

Miralda-Escude, Haehnelt and Rees (2000) provide physical moti- 
vation for the following fitting function for the (volume-weighted, 
real space) overdensity A, 

P(A) dA = A exp [ - { %l^f ] A"* 3 dA . (11) 

Their Table 1 provides values for A, Co, So and (3 at redshifts 
z — 2, 3, 4 and 6, which they obtained from fitting their numerical 
simulations. The exponent guarantees that the PDF is a Gaussian in 
A — 1 when Co = 1 and the dispersion So <C 1. 
We can use this as an Ansatz for the PDF of r, given the relation 
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Table 2. best fitting parameters (Eq. 12) for the PDF of scaled optical depth, 
within different redshift bins, restricting the fit to —2 < log(r/ro) < 1 
(thin lines in Fig. 12) 
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Figure 11. PDF of the true, scaled optical depth, t/tq, of a large sample 
(20) of mock LP quasars, in the redshift range [1.7,1.9], [1.9,2.1], [2.15, 
2.35], [2.85, 3.05]. A redshift scaling to oc (1 + z) 5 is assumed pixel by 
pixel, the mean redshift is indicated in the panel. Limits in optical depth for 
the censored PDFs, are indicated by thin vertical lines (with corresponding 
types). The PDFs have a Gaussian shape, with a more extended power-law 
tail toward low as well as higher optical depths. The shape of the scaled 
PDF is almost independent of redshift over nearly three decades in — 1 < 
log(r/To) < 2. 



Eq. (2) between A and r. We expect the exponent in the exponen- 
tial to change -2/3 -2(1 + /3)/3, and 1 + /3 = [0.5, 0.6] for 
7 = [1, 1.6], hence we fit 

{x- 2 ^' A -Co) 2 " 



P(x) dx = A exp 



2(2<5 /3) 2 



(12) 



where x = log(r/ro), with free parameters v m 1 + j3, Co, do and 

H, and A a normalisation constant. Restricting the fit to —2 < x < 

I, we show the best fitting PDFs in Fig. 12 and provide the best 
fitting parameters in Table 2. The best fitting value for Co « is 
kept constant. The dispersion So differs significantly from the best 
fitting one to the density PDF, but the value of the exponent v is 
close to expected. 

These fits are overlaid on the censored PDF of the observed LP 
quasar sample in Fig. 13. The good agreement suggest that the 
mock sample is indeed representative of the observed distribution. 
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log t(z)/t o (z) 

Figure 12. Fits of the form Eq. (12) (full lines) to the scaled PDFs shown 
in Fig. 11, represented here by the histograms. The fits shown by the thin 
line restrict the fitted region to that of the censored optical depth (vertical 
lines). Different redshift range indicated in the panel are off-set vertically 
and horizontally by 0.05 and 0.1 respectively, for clarity. The fitting function 
does reasonably well around the maximum and in the power-law tail toward 
higher t, but is not able to fit the more non-linear parts at very high and very 
low t. The fit to the censored optical depth (thin lines) does not recover well 
the PDF around the maximum. 
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Figure 13. Overlay of the fits to the scaled PDF of the mock sample from 
Fig. 1 1 to the (censored) scaled PDF of the LP quasar sample. The same 
redshift scaling ro oc (1+z) 5 is assumed for the LP data. The same redshift 
range are indicated and shifted as in Fig. 12. The agreement is very good, 
increasing our confidence that the mock samples are sufficiently realistic 
for validating our method. 



